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Abstract: Embryonic stem (ES) cells are characterized by the expression of an extensive and interconnected network of 
pluripotency factors which are downregulated in specialized cells. Epigenetic mechanisms, including DNA methylation 
and histone modifications, are also important in maintaining this pluripotency program in ES cells and in guiding correct 
differentiation of the developing embryo. Methylation of the cytosine base of DNA blocks gene expression in all cell 
types and further modifications of methylated cytosine have recently been discovered. These new modifications, putative 
intermediates in a pathway to erase DNA methylation marks, are catalyzed by the ten-eleven translocation (Tet) proteins, 
specifically by Tetl and Tet2 in ES cells. Surprisingly, Tetl shows repressive along with active effects on gene expression 
depending on its distribution throughout the genome and co-localization with Polycomb Repressive Complex 2 (PRC2). 
PRC2 di- and tri-methylates lysine 27 of histone 3 (H3K27me2/3 activity), marking genes for repression. In ES cells, al- 
most all gene loci containing the repressive H3K27me3 modification also bear the active H3K4me3 modification, creating 
"bivalent domains" which mark important developmental regulators for timely activation. Incorporation of Tetl into the 
bivalent domain paradigm is a new and exciting development in the epigenetics field, and the ramifications of this novel 
crosstalk between DNA and histone modifications need to be further investigated. This knowledge would aid reprogram- 
ming of specialized cells back into pluripotent stem cells and advance understanding of epigenetic perturbations in cancer. 
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INTRODUCTION 

Embryonic stem (ES) cells are isolated from the inner 
cell mass of the early-stage blastocyst and can be character- 
ized by their pluripotency and capacity for self-renewal. The 
pluripotent stem cell program is driven by the "core" tran- 
scription factors Oct4, Sox2 and Nanog which form a stable 
autoregulatory loop that can also poise the cell for differen- 
tiation [1]. The pluripotent state also displays characteristic 
epigenetic marks, including bivalent histone domains on 
developmental regulators [2] and prominent non-CpG meth- 
ylation throughout the genome [3]. 

Histone modifications can be either activating or repres- 
sive depending on the location and the chemical mark. Two 
canonical modifications are trimethylation at lysine 27 of 
histone H3 (H3K27me3), which is a repressive mark cata- 
lyzed by Polycomb Repressive Complex 2 (PRC2), and 
trimethylation at lysine 4 of histone H3 (H3K4me3), which 
is an activating mark catalyzed by the Trithorax family of 
proteins [4]. H3K27me3 and H3K4me3 co-localize on a sig- 
nificant number of developmental regulators in ES cells, 
forming the distinctive "bivalent domain" which poises these 
gene loci for activation upon differentiation [5]. 
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DNA methylation was thought to be responsible for long- 
term and stable repression of gene expression and has been 
implicated in X-inactivation, genomic imprinting and retro- 
transposon silencing [6] . In adult somatic cells, DNA methy- 
lation and the resultant 5-methylcytosine (5mC) mark pre- 
dominantly occur at the cytosine of CpG dinucleotides, but 
in ES cells near a quarter of all DNA methylation happens in 
a non-CpG context [3]. Recently, global demethylation 
events have been characterized in the zygote and developing 
primordial germ cells [7-10], and extensive turnover of loci- 
specific methylation has been observed during cellular dif- 
ferentiation [3, 11], changing our conception of DNA methy- 
lation into that of a more dynamic mark. 

The discovery of the ten-eleven translocation (Tet) fam- 
ily of DNA hydroxylases revolutionized the epigenetics 
field, sparking an incredibly rapid investigation of possible 
DNA demethylation pathways mediated by Tet and of the 
protein family's functional significance in multiple important 
gene regulation contexts, including cancer and ES cells. 
There are three members within the Tet family of proteins, 
namly Tetl, Tet2 and Tet3, and this review will primarily 
focus on the first two members, which are upregulated in the 
pluripotent state and downregulated during differentiation 
[12]. While many details about the regulation of DNA meth- 
ylation are still unknown, an exciting discovery involves the 
novel dual functions of Tetl occupancy on specific gene loci 
in the ES cell [13-15]. These recent studies link DNA modi- 
fication status with that of the surrounding histone marks, 
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and this crosstalk is important to keep in mind for future 
manipulations of gene regulation, whether in the context of 
induced pluripotent stem (iPS) cells or cancer treatment. 

ROLE OF TET IN DNA DEMETHYLATION 

Tetl is a Fe(II)- and alpha-ketoglutarate (a-KG)- 
dependent DNA hydroxylase that was first discovered as a 
fusion partner of the mixed lineage leukemia (MLL) gene in 
a rare form of acute myeloid leukemia (AML) containing the 
t(10; Il)(q22;q23) translocation [16, 17]. Tetl and Tet2, as 
well as Tet3, became subjects of intense research once they 
were found to have catalytic activity on 5-methylcytosine 
(5mC) [18], converting the initial cytosine modification to 5- 
hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 
5-carboxylcytosine (5caC) through successive oxidation 
steps [10, 19-21]. 5hmC accumulates in cells at appreciable 
levels of around a hundred to several thousand modified 
bases per million unmodified C, but 5fC and 5caC levels are 
barely detectable, even in cell types enriched with the Tet 
family of proteins [21]. However, accumulation of 5caC lev- 
els can be observed when thymine-DNA glycosylase, re- 
sponsible for excision of 5caC, is depleted in mouse embry- 
onic stem cells [21]. 

These new cytosine modifications are widely believed to 
be the first steps of active DNA demethylation, but there has 
only been one study showing a robust mechanism for replica- 
tion-independent demethylation [22]. There are currently three 
theories about how active demethylation can be completed 
from Tet-mediated cytosine modifications [23] see Fig. (1). 
First, 5fC or 5caC could be excised by thymine DNA glycosy- 
lase (TDG) [20] and subsequently replaced with unmodified C 
through the Base Excision Repair (BER) pathway. Second, 
5hmC could be deaminated by the activation induced 



deaminase/apolipoprotein B editing complex (AID/APOBEC) 
family of cytidine deaminases to form 5-hydroxymethyluracil 
(5hmU), which would then also be excised by a DNA glyco- 
sylase and subject to BER [24]. Third, 5caC could be directly 
decarboxylated to generate unmodified cytosine, though there 
is yet no direct evidence of decarboxylase. The second 
mechanism involving AID/ APOBEC and mediated by Tetl 
has been observed in vivo in the adult mouse brain, though the 
research group did not rule out the presence of other contribut- 
ing mechanisms [22]. AID deficiency also weakens global 
demethylation in mouse primordial germ cells [25]. However, 
the two BER pathways are considered unlikely to be solely 
responsible for the global demethylation events seen in em- 
bryonic development, as large-scale BER would put excessive 
strain on genome stability [26]. Furthermore, a recent study 
has revealed no detectable deamination of 5hmC by 
AID/APOBEC in vitro, which the authors attribute to 5hmC's 
increased steric bulk compared to cytosine and 5mC [27]. This 
study challenges the viability of the AID/APOBEC demethy- 
lation pathway. The search for a 5caC decarboxylase - or an- 
other yet-undiscovered mechanism for genome- wide demethy- 
lation - will undoubtedly continue in the future. 

A pathway for passive demethylation using 5hmC has also 
been elucidated. On the basis of findings from others that 
Tet3-mediated oxidation of 5mC to 5hmC is important for 
global demethylation of the paternal genome in zygotes and 
preimplantation embryos [28-30], Inoue et al. suggested that 
loss of 5hmC during preimplantation is likely a DNA replica- 
tion-dependent passive process by use of an immuno staining 
approach [31, 32]. Nonetheless, caution should be exercised, 
because immuno staining is not quantitative and therefore it is 
possible that only a portion of 5mC is converted to 5hmC 
whereas the remaining portion of 5hmC is removed by an al- 
ternative pathway [33]. 
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Fig. (1). Dual roles of Tetl in embryonic stem cells. (A) The conventionally recognized role of Tetl as a transcriptional activator. Even 
though the pathways for active demethylation have not been fully elucidated, 5hmC, 5fC and 5caC are still generally accepted as intermedi- 
ates in the demethylation pathway. (B) The novel role of Tetl as a transcriptional repressor. Tetl indirectly mediates binding of EZH2 to 
gene promoters to create the repressive H3K27me3 mark, leading to heterochromatin and repression of gene expression. 



Tet in ES Cells 

ROLE OF TET IN EMBRYONIC STEM CELLS 

The role of 5hmC and the Tet family of proteins in ES 
cell pluripotency, self-renewal and lineage specification has 
been discussed ever since the function of Tetl was first dis- 
covered [18]. Both Tetl and Tet2 are upregulated in ES cells 
- though Tetl shows up to 5-fold higher expression than 
Tet2 - and their expression levels drop after induced differ- 
entiation; similarly, 5hmC is also enriched in ES cells and 
downregulated during development [18, 34, 35]. Tetl and 
Tet2 have been shown to be binding targets of Oct4 [34], 
incorporating the proteins into well-characterized pluripo- 
tency machinery, but the necessity of Tetl/2 to pluripotency 
maintenance is still unclear. Several groups have shown sig- 
nificant loss of ES cell morphology after Tetl knockdown, 
coinciding with a decrease in 5hmC levels and downregula- 
tion of Nanog through hypermethylation of its promoter [12, 
18]. Other groups have observed bias in lineage specification 
after knockdown of Tetl or Tet2, skewing ES cells towards 
trophoectoderm or mesoendoderm commitment by upregu- 
lating specific differentiation genes such as Cdx2, Gata6, 
Eomes and Elf5 [12, 34]. Interestingly, Koh et al. found that 
Tetl and Tet2 showed antagonistic effects against each other 
at several developmental regulators, suggesting that, despite 
having similar function, the proteins may be responsible for 
two different lineage specifications [34]. Koh et al. [34] also 
saw no effect on Nanog expression or the stem cell pheno- 
type after Tetl or Tet2 knockdown, an observation that was 
verified by other groups [15]. Similarly, Dawlaty et al. [36] 
were able to generate Tetl knockout ES cells that retained 
pluripotency and in fact developed into knockout pups. They 
also generated viable Tetl knockout mice through crosses of 
heterozygous Tetl +/ " mice, though the mutant progeny were 
slightly smaller in body size than wild-type mice [36]. It is 
possible that the effects of Tetl knockout were compensated 
for by Tet2 activity, and it will be important to generate dou- 
ble knockout ES cells and mice in order to fully test for the 
role of Tet proteins in pluripotency. 

The molecular mechanisms behind Tetl and Tet2's func- 
tions in ES cells are not clearly proven, though, surprisingly, 
demethylation of bound loci is not the only effect of Tetl 
occupancy. Many genome- wide studies of Tet, 5mC and 
5hmC occupancy have been conducted in order to better 
understand the functions and interactions of these epigenetic 
marks [13, 15, 23, 37-42], and new techniques have been 
created to detect 5hmC on genome-wide scale [43, 44] and 
more recently also at single-base resolution [45-47]. 

A majority of the genome-wide studies discovered new 
complexities in Tetl and 5hmC signaling in ES cells. First, 
Tetl and 5hmC did not colocalize as extensively as would be 
expected from their association, though both were clustered 
in gene-rich areas of the genome [15, 26, 37]. Tetl binds 
DNA through its N terminus CXXC domain, which has been 
shown to bind preferentially to unmodified, CpG rich DNA 
[15]. In light of this binding preference, it is not surprising 
that Tetl is heavily enriched at high CpG promoters and ex- 
ons [15, 26, 37], which have been previously associated with 
low DNA methylation [48]. However, 5hmC was shown to 
be excluded from high CpG density promoters, even though 
the mark is enriched according to CpG density within the 
gene body [15]. One group found only a 30% overlap of 
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Tetl and 5hmC peaks within the ES genome, distributed 
roughly equally between promoter, intron, exon and inter- 
genic regions [15]. At these overlapping regions, Tetl 
knockdown downregulates 5hmC levels and upregulates 
5mC levels, but Tetl knockdown does not affect 5hmC lev- 
els at other gene loci [15]. The disparity between Tetl and 
5hmC loci does not, by itself, prove a new function for either 
Tetl or 5hmC. Tetl could be acting on 5mC without fully 
binding to the DNA, and 5hmC could be too transient an 
intermediate between 5mC and further oxidation derivatives 
to be detected at Tetl -bound loci. However, when looking 
for differential gene expression between control and Tetl- 
knockdown cells, researchers observed both Tetl -activated 
and Tetl -repressed targets, direct evidence for a novel re- 
pressive function of Tetl [13-15, 26]. In fact, both groups 
observed a greater number of Tetl -repressed targets, i.e. 
genes that are upregulated upon Tetl knockdown, than Tetl- 
activated targets, or genes that are downregulated upon Tetl 
knockdown [13-15, 26]. This finding, combined with other 
evidence, suggested a new mechanism of gene expression by 
Tetl outside of hydroxymethylation. 

REGULATION OF iPS CELLS BY TET2 

An intriguing area of interest is the study of induced 
pluripotent stem cells (iPS) and the ability of these cells to 
be used in science and medicine. These cells are generated 
by reprogramming somatic cells by using pluripotency fac- 
tors Oct4, Sox2, Klf4 and c-MYC (referred to as OSKM) 
[49]. Very recently, Doege et al. found reprogramming by 
OSKM led to Parpl (ADP-ribose polymerase-1) and Tet2 
recruitment to the loci of Nanog and Esrrb, previously 
known pluripotency loci [50]. Although somatic epigenetic 
signatures are lost in iPSC reprogramming [51] and both 
genes were previously indicated in epigenetic remodeling, 
this was the first evidence for their role in the reprogram- 
ming of pluripotency loci. They found that Tet2, but not Tetl 
or Tet3, was significantly induced in somatic cells trans- 
formed with OSKM both at early reprogramming stages and 
at the iPSC stage. Moreover, loss of Parpl or Tet2 led to 
decreased chromatin-active histone modifications at pluripo- 
tency loci. Reduction of Tet2 blocked induction of hy- 
droxymethylation at pluripotency loci while methylation 
changes varied. This, combined with the early induction of 
hydroxymethylation at these loci, suggests that 5hmC poten- 
tially serves as a distinct epigenetic mark from 5mC. To- 
gether these results reveal a new function of Tet2 in the in- 
duction of pluripotency as well as suggesting that hy- 
droxymethylation potentially acts outside the demethylation 
pathway and might promote chromatin remodeling [50]. 

THE POLYCOMB REPRESSIVE COMPLEX IN EM- 
BRYONIC STEM CELLS 

Another key component to the control of ES cell differ- 
entiation is the regulation of the Polycomb Repressive Com- 
plexes 1 and 2 (i.e., PRC1 and PRC2), with a majority of the 
existing scientific research focused on PRC2. The PRC1/2 
complexes create repressive histone marks on their target 
genes, thus blocking expression of these genes. The repres- 
sive marks known to be catalyzed by PRC1 and PRC2, to 
date, are monoubiquitylation of histone H2AK119 (H2AK 
119ul) and dimethylation and trimethylation of histone 
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H3K27 [52-58]. While the Polycomb complexes have been 
known for several years to be important regulators of cell 
fate decision [59, 60], it was not until recently that this com- 
plex was clearly tied to the maintenance and differentiation 
of ES cells [61]. In 2006, several groups showed simultane- 
ously that Polycomb regulated targets were transcriptionally 
repressed in both human and mouse ES cells and reactivated 
during ES cell differentiation [62-64]. Since then, more evi- 
dence has accumulated that highlights the importance of 
these complexes in the regulation of embryonic stem cells. 
For example, knockout of SUZ12, EZH2, PCL2, EED or 
JAPJD2, all components of PRC2, in embryonic stem cells 
results in the inability of these stem cells to undergo proper 
differentiation and the expression of higher levels of pluripo- 
tency markers (reviewed in [65]). Those studies have dem- 
onstrated that PRC1 and PRC2 are essential components of 
ES cell maintenance and differentiation. 

TET1 INTERACTION WITH THE POLYCOMB RE- 
PRESSIVE COMPLEX IN EMBRYONIC STEM 
CELLS 

While it was known that both Tetl expression and PRC1 
and PRC2 regulation were required to maintain embryonic 
stem cells, there was little to no functional connection be- 
tween Tetl and PRC 1/2 until very recently. In 2011, Wu et 
al. [13, 14, 23, 26] connected Tetl -repressed targets to the 
recruitment of a repressive histone mark, H3K27me3, to the 
gene promoter. They also discovered that Tetl knockdown 
substantially compromises the ability of Ezh2 to bind to 
PRC2/Tetl cobound gene loci in ES cells [13, 14, 23, 26] see 
Fig. (1). Importantly, Nanog overexpression in Tetl -knock 
down cells cannot fully rescue Ezh2 binding to its target loci, 
showing that Tetl's effects on Ezh2 do not result from dis- 
ruption of core pluripotency machinery [13, 14, 23, 26]. No 
stable interactions have yet been observed between Tetl and 
Ezh2, and Tetl knockdown does not reduce PRC2 expres- 
sion or stability [13, 14, 23, 26], producing an intriguing 
question - how does Tetl affect PRC2 binding without di- 
rect interaction with its subunits? 

FUTURE WORK 

There is still much we do not know about DNA modifi- 
cations and about epigenetic regulation in general. New 
techniques have been developed to distinguish 5hmC from 
5mC, even at single-base resolution [45-47], but these meth- 
ods need to be further refined to be able to determine all lay- 
ers of DNA modifications (5mC, 5hmC, 5caC and 5fC) at a 
single cell level. Once detection methods for all epigenetic 
marks and protein-DNA binding have reached this degree of 
resolution, we will be able to better understand how crosstalk 
interactions occur between epigenetic regulatory complexes 
and their resultant marks. 

More work is also required to better understand the con- 
nection between reading, writing and interpreting the histone 
code, both on its own and as it relates to other epigenetic 
factors including methylation/hydroxymethylation. Specifi- 
cally in ES cells, a majority of the research so far has fo- 
cused on the function and localization of Tetl, and Tet2 
must be incorporated into this regulatory picture. Other epi- 
genetic mechanisms should also be integrated, including the 



further oxidation derivatives of 5hmC, other histone modifi- 
cations and ES cell-specific non-coding RNAs. This future 
work will provide a better understanding of stem cell pluri- 
potency and of crosstalk between the different mechanisms 
of gene regulation. 
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ABBREVIATIONS 

ES = Embryonic Stem 

Tet = Ten-Eleven Translocation 

PRC = Polycomb Repressive Complex 

H3K27me3 = Trimethylation At Lysine 27 of Histone H3 

H3K4me3 = Trimethylation At Lysine 4 of Histone H3 

5mC = 5-Methylcytosine 

5hmC = 5-Hydroxymethylcytosine 

5fC = 5-Formylcytosine 

5caC = 5-Carboxylcytosine 

iPS = Pluripotent Stem 

MLL = Mixed Lineage Leukemia 

AML = Acute Myeloid Leukemia 

TDG = Thymine DNA Glycosylase 

BER = Base Excision Repair 

AID = Activation Induced Deaminase 

APOBEC = Apolipoprotein B Editing Complex 
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